Appl. No- 09/392,170 

Amdt. dated July 9, 2007 

Reply to Office Action of February 7, 2007 

Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in 
the application: 

Listing of Claims: 

1 . (Canceled). 

2. (Currently amended) A computer-implemented method for randomly 
walking through a hyper-text-linked document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, each document being associated with a host, the method 
comprising: 

a) selecting a host: 

b) selecting at random a document associated with the host; 

c) retrieving the selected document; 

d) randomly choosing whether to select a random new document; 

e) responsive to e^^ff^e^^ to select the 
random new document : 

[[d]]e.1) selecting at random a new host from among the 

previously selected hosts; 
[[d]]e.2) selecting at random a new document associated with the 

new host; and 
[[d]]e.3) retrieving the selected new document; 
[[e]]f) responsive to &8fK8€*3W^^ not to 

select the random new document : 

[[e]]!- 1 ) selecting at random a link in the retrieved document; and 
[[e]]f.2) retrieving a document referenced by the selected link; 
and 
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[[f]]S) repeating d) A and then conditionally repeating e) or f) depending 
upon the choosing made in dV ^mt-^ 
me*. 

3. (Canceled). 

4. (Previously presented) The method of claim 2, wherein the document set 
is the World Wide Web, and wherein each document is a web page. 

5. (Original) The method of claim 4, wherein each host corresponds to a 
domain. 

6. (Currently amended) The method of claim 2, further comprising, 
concurrently with a) through d), with e) or f) . and with q , performing a second two- 
level random walk through the hypertext-linked document set. 

7. (Currently amended) A computer-implemented method for randomly 
walking through a hypertext-linked document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, each document being associated with a host, the method 
comprising: 

a) initializing a host set; 

b) initializing a document set for each host in the host set; 

c) selecting at random a host from the host set; 

d) selecting at random a document from the document set of the 
selected host; and 

e) responsive to the selected document containing at least one link: 
e.1 ) selecting at random a link from the selected document; 
e.2) selecting a document corresponding to the selected link; 
e.3) selecting a host corresponding to the selected document; 
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e.4) adding the selected host to the host set; 

e.5) adding the selected document to the document set of the 

selected host; and 
e.6) repeating e.1) through e.5) until a™4lfs^^ 

c-x->mirtkm4BHm>t ajl links have been traversed f-and 
1} repea&n§--<^-4hf^ 

8. (Previously presented) The method of claim 7, wherein: 

e.4) is performed responsive to the selected host not being in the 
host set; and 

e.5) is performed responsive to the selected document not being 
in the document set of the selected host. 



9. (Currently amended) A computer-implemented method for randomly 
walking through a hypertext-linked document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, each document being associated with a host, the method 
comprising: 

a) initializing a host set; 

b) initializing a document set for each host in the host set; 

c) selecting at random a host from the host set; 

d) selecting at random a document from the document set of the 
selected host; 

e) randomly choosing whether to select a random new document; and 

f) responsive to nQ-rnsc^rref^ not to 
select a random new document and further responsive to the 
selected document containing at least one link: 

) selecting at random a link from the selected document; 
[[e]]!- 2 ) selecting a document corresponding to the selected link; 
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[[e]]i-3) selecting a host corresponding to the selected document; 

[[e]]f-4) adding the selected host to the host set; 

[[e]M-5) adding the selected document to the document set of the 

selected host; and 
[[e]]f.6) repeating [[e]]f.1) through [[e]]f.5) until a — first 

g^^eter-g^ii^d--^fKjjtj0n"te--met all links have been traversed : 

and 

ipg| — ^pea^g^o^ 

met. 

10. (Canceled). 

1 1 . (Original) The method of claim 7, wherein the hypertext- linked document 
set is the World Wide Web, and wherein each document is a web page. 

12. (Original) The method of claim 11, wherein each host corresponds to a 
domain. 

13. (Original) A computer-implemented method for measuring relative quality 
of a search engine index, comprising: 

a) performing a two-level random walk among documents within a 
document set; 

b) for each document encountered in the random walk, determining 
whether the document is indexed by the search engine index; and 

c) aggregating the results of b). 

14. (Currently amended) The method of claim 1 3, wherein at least a subset of 
the documents contain a plurality of links to other documents, each document 
being associated with a host, and wherein a) comprises: 

a.1) selecting a host; 
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a.2) selecting at random a document associated with the host; 
a.3) retrieving the selected document; 
a.4) selecting at random a link in the retrieved document; 
a.5) retrieving a document referenced by the selected link; and 
a.6) repeating a.4) and a.5) until a-^€lelBf^^ 
links have been traversed. 



15. (Currently amended) A computer-implemented method for measuring 
relative quality of a search engine index, comprising: 

a) performing a two-level random walk among documents within a 

document set, by: 

a.1) selecting a host; 

a.2) selecting at random a document associated with the host; 
a.3) retrieving the selected document; 

a.4) randomly choosing whether to select a random new 

document; 

a-[[3]]4.1 ) responsive to eeGUfFenee- — — a — random 
eveftt choosing to select the random new 
document: !!:!! 
a[[3]]4.1 .1) selecting at random a new host from 

among the previously selected hosts; 
a.[[3]]4.1 .2) selecting at random a new document 

associated with the host; and 
a.[[3]]4.1 .3) retrieving the selected new document; 
a.[[3]]4.2) responsive to fien-^eeyfreRee--^ — {h^-H=ar=rdem 
eveR tohoosinq not to seiect the random new 
document : 

a.4 .2.1 ) selecting at random a link in the 
retrieved document; and 
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a.rr511 4.2.2 ) retrieving a document referenced by the 
selected link; and 
a-[[6]]5) repeating a.4), and then conditionally repeating 
ar3r4 4a,4.1) through a.4.1.3) or a.4.2) through aJ)aA^2) 
depending upon the choosing made in a„4) — tmth — a 

b) for each document encountered in the random walk, determining 
whether the document is indexed by the search engine index; and 

c) aggregating the results of b). 

1 6. (Currently amended) The method of claim 1 3, wherein at least a subset of 
the documents contain a plurality of links to other documents, each document 
being associated with a host, and wherein a) comprises: 
a.1 ) initializing a host set; 

a.2) initializing a document set for each host in the host set; 
a.3) selecting at random a host from the host set; 

a.4) selecting at random a document from the document set of the 
selected host; 

a.5) adding #H>»a host that is referenced by the -a_selected link to the 
host set; 

a.6) adding tfoe-a. _ docu m ent referenced by the selected link to the 

document set of the selected host; 
a.7) responsive to the selected document containing at least one link: 

a.7.1 ) selecting at random a link from the selected document; 

a.7.2) selecting a document corresponding to the selected link; 

a.7.3) selecting a host corresponding to the selected document; 

and 

a.7.4) repeating a.5) through a.8) until a-p^ecleteff^ 
roei all links have been traversed ; and 
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a.8) responsive to the selected document not containing at least one 
link, repeating a.3) through a.6), and further conditionally repeating 
a.7) or a.8)., until^H$f€KM^^ aH documents 

have been traversed . 

17. (Original) The method of claim 16, wherein: 

a.5) is performed responsive to the selected host not being in the host 
set; and 

a. 6) is performed responsive to the selected document not being in the 

document set of the selected host. 

18. (Original) The method of claim 13, wherein each document contains a 
plurality of words, and wherein b) comprises, for each document encountered in 
the random walk: 

b. 1 ) selecting at least one word from the document; 

b.2) performing a query on the search engine index based on the 
selected at least one word, to obtain search results; and 

b.3) determining whether the document is included in the obtained 
search results. 

19. (Original) The method of claim 18, wherein b.1) comprises selecting at 
least one word based on rarity. 

20. (Previously presented) A computer-implemented method for measuring 
relative quality of a target document in a document set, comprising: 

a) performing a two-level random walk among documents within a 
document set; and 

b) determining a quality metric responsive to the number of times the 
target document is encountered in the random walk. 
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21. (Previously presented) A computer-implemented method for measuring 
relative quality of a target document in a document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, the method comprising: 

a) performing a two-level random walk among documents within a 
document set; and 

b) determining a quality metric responsive to the number of documents 
encountered during the two-level random walk that link to the target 
document. 

22. (Previously presented) The method of claim 21, wherein b) comprises 
determining a quality metric responsive to the number of documents that link to 
the target document, and responsive to the quality metric of the linking 
documents. 

23. (Previously presented) The method of claim 21, wherein b) comprises 
determining a value for: 

R(p) = dlT + {\-dj£ d R( Pi ) / C( Pi ) 

i=i 

where: 

R(p) is the PageRank of target document p; 

R(Pi) is the PageRank of document pjj 

T is the total number of documents in the document set; 

d is a damping factor such that 0<d< 1 ; 

documents pi, ... , P k each contain at least one link to target document p; 
and 

C(pi) is the number of links out of document pj. 

24. (Currently amended) A computer-implemented method for measuring 
relative quality of a target document in a document set comprising a plurality of 
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documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, wherein each document is associated with a host, the 
method comprising: 

a) performing a two-level random walk among documents within a 
document set by: 

a.1) selecting a host; 

a.2) selecting at random a document associated with the host; 
a.3) retrieving the selected document; 

a.4) randomly choosing whether to select a random new 
document; 

a.5) responsive to eeeuffeRe^-^ 

select the random new document : 

a.[[4]]5.1) selecting at random a host from among the 

previously selected hosts; 
a-[[4]]5.2) selecting at random a document associated 

with the host; and 
a.[[4]]5.3) retrieving the selected document; 
a-[[5]]6) responsive to ^c^esewref^-^^ 

m^ef^ choosing not to seject the random new document : 
a-[[5]]6.1) selecting at random a link in the retrieved 
document; and 

ci-[[5]]8.2) retrieving a document referenced by the 
selected link; and 
a-[[6]]Z) repeating a.4) , and then conditionally repeating fe-a.5) 
through a.5.3) or a.S) through a.6.2) depending upon the 
choosing made in a.4) ^n#-a^f9€te^ 
and 

b) determining a quality metric responsive to the number of documents 
encountered during the two-level random walk that link to the target 
document. 
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25. (Currently amended) A computer-implemented method for measuring 
relative quality of a target document in a document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, wherein each document is associated with a host, the 
method comprising: 

a) performing a two-level random walk among documents within a 

document set, by: 

a.1 ) initializing a host set; 

a.2) initializing a document set for each host in the host set; 
a.3) selecting at random a host from the host set; 
a.4) randomly choosing whether to select a random new host; 
a.5) responsive to choosing to select the random new 

a.[[4]]5.1) selecting at random a new host from among 
the previously selected hosts; 
a-[[5]]6) responsive to choosing not to select the random new 
hostnefH3G€^^ : 
a.[[5]]8.1) selecting at random a document from the 

document set of the selected host; and 
a-[[5]]6.2) responsive to the selected document 
containing at least one link: 

a.[[5]]8.2.1) selecting at random a link from the 

selected document; 
a.[[5]]6.2.2) selecting a document corresponding to 

the selected link; 
a.[[5]]8.2.3) selecting a host corresponding to the 

selected document; and 
a-[[5]]6.2.4) adding the selected host to the host set; 
a-[[5]]6.2.5) adding the selected document to the 

document set of the selected host; 
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a-[[5]]6.2.6) repeating a.[[5]]8.2.1 ) through 
a.[[5]]6.2.5) until £i-4ifs^^ 
eef^^km — — R^^ aii links have been 
traversed ; and 

a[[6]]Z) repeating a.3) through a. 4), and then conditionally 
repeating a. 5) through a-5.1) or a.6) through a.6-2.6)- bsfs-t^-a 
seeeB4-p^ ; an d 

b) determining a quality metric responsive to the number of documents 
encountered during the two-level random walk that link to the target 
document. 



26. (Previously presented) The method of claim 21 , further comprising: 

c) determining a quality metric for at least one additional target 
document; and 

d) ranking the quality metric of the first target document with respect to 
the quality metrics of the additional target documents. 



27. (Currently amended) A computer-implemented method for randomly 
walking through a hypertext-linked document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, each document being associated with a host, the method 
comprising: 

a) selecting a host; 

b) selecting at random a document associated with the host; 

c) retrieving the selected document; 

d) randomly choosing whether to select a random new host; 

e) responsive to eee&FFeftee-el-a^ to select the 

random new host : 

[[d]]e.1) selecting at random a new host from among the 
previously selected hosts; and 
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[[d]]e.2) repeating b) through d), and then conditionally e) through 
e.2) or f) through f„3 until a-wteiw^^^ 
fBe iall documents have been traversed ; and 
[[©Hi) responsive to fMafHaeey-r^^ not to 

select the random new host : 

) selecting at random a link in the retrieved document; 
[[e]]|-2) retrieving a document referenced by the selected link; 
and 



28. (Currently amended) A computer-implemented method for measuring 
relative quality of a target document in a document set comprising a plurality of 
documents, wherein at least a subset of the documents contain a plurality of links 
to other documents, the method comprising: 

a) performing a two-level random walk among documents within a 

document set, by: 

a.1 ) initializing a host set; 

a.2) initializing a document set for each host in the host set; 
a.3) selecting at random a host from the host set; 
a- 4 ) randomly choosing whether to select a random new host; 
a.5) responsive to eeeweRae^ choosing to 



[[e]]f-3) 



repeating d)^ and then conditionally e) through e.2) or f) 
through f.3) until a-erestetem^^ links 



have been traversed. 



select a random new host : 



a-[[4]]5.1) selecting at random a new host from among 
the previously selected hosts; 



a.[[5]]6) 



responsive to f^efreeeyffeBee — el—fee — F^Rdem 



evem choosing not to select the random new host : 



a.[[5]]6.1) selecting at random a document from the 
document set of the selected host; and 
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a-[[5]]6.2) responsive to the selected document 
containing at least one link: 

a.[[5]]8.2.1) selecting at random a link from the 

selected document; 
a.[[5]]6.2.2) selecting a document corresponding to 

the selected link; 
a-[[5]]8.2.3) selecting a host corresponding to the 

selected document; and 
a-[[5]]6.2.4) adding the selected host to the host set; 
a-[[5]]6.2.5) adding the selected document to the 

document set of the selected host; 
a.[[5]]8.2.6) repeating a.[[5]]6.2.1 ) through 

a.[[5]]6.2.5) until •a--fe-&t--pi : Bdel^™ned 

£9f*df£f9ff — — mat all Sinks have been 

traversed ; and 

a-[[6]]Z) repeating a.3) through a.4), and then conditionally 
repeating a,5) through a,5,1) or a.rrsilS) through a,6.2.6) 
un#-ar£&een€i~pr8€ie£&^ and 

b) determining a quality metric responsive to the number of documents 
encountered during the two-level random walk that link to the target 
document; 

c) determining a quality metric for at least one additional target 
document; and 

d) ranking the quality metric of the first document with respect to the 
quality metrics of the additional target documents. 



29. (Canceled). 



30. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
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randomly walking through a hypertext-linked document set comprising a plurality 
of documents, wherein at least a sub-set of the documents contain a plurality of 
links to other documents, each document being associated with a host, the 
computer program product comprising: 

a) computer-readable program code devices configured to cause a 
computer to select a host; 

b) computer-readable program code devices configured to cause a 
computer to select at random a document associated with the host; 

c) computer-readable program code devices configured to cause a 
computer to retrieve the selected document; 

d) computer-readable program code devices configured to cause a 
computer to randomly choose whether to select a random new 
document; 

e) computer-readable program code devices configured to cause a 

computer to, responsive to eaaufmRe^-el-a-f ai^^m-eve ^ch oo si no 
to select the random new document : 

[[d]]e.1) select at random a new host from among the 

previously selected hosts; and 
[[d]]e.2) select at random a new document associated with the 
host; and 

[[d]]e.3) retrieve the selected newdocument; 
[[e]]D computer-readable program code devices configured to cause a 
computer to, responsive to iwH^eebH^efiee — e£ — fee — FaRte 
a-v-ef^ choosing not to select the random new document : 
[[©]]!■ 1 ) select at random a link in the retrieved document; and 
[[e]]!- 2 ) retrieve a document referenced by the selected link; 
and 

[[ f ]]fiO computer-readable program code devices configured to cause a 
computer to repeat the operations of d) and then conditionally 
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repeat the operations of e) or f) depending on the choice made in 

31. (Canceled). 

32. (Previously presented) The computer program product of claim 30, 
wherein the document set is the World Wide Web, and wherein each document is 
a web page. 

33. (Canceled). 

34. (Canceled). 

35. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
randomly walking through a hypertext-linked document set comprising a plurality 
of documents, wherein at least a subset of the documents contain a plurality of 
links to other documents, each document being associated with a host, the 
computer program product comprising: 

a) computer-readable program code devices configured to cause a 
computer to initialize a host set; 

b) computer-readable program code devices configured to cause a 
computer to initialize a document set for each host in the host set; 

c) computer-readable program code devices configured to cause a 
computer to select at random a host from the host set; 

d) computer-readable program code devices configured to cause a 
computer to select at random a document from the document set of 
the selected host; and 
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e) computer-readable program code devices configured to cause a 
computer to, responsive to the selected document containing at 
least one link: 

e.1 ) select at random a link from the selected document; 
e.2) select a document corresponding to the selected link; 
e.3) select a host corresponding to the selected document; and 
e.4) add the selected host to the host set; 

e.5) add the selected document to the document set of the 

selected host; and 
e.6) repeat the operations of e.1) through e.5) until ail links have 

been traversed a4lfslH$f^^^ 
$ eemf^fef-feada^^ 

36. (Original) The computer program product of claim 35, wherein: 

the computer-readable program code devices configured to cause a 
computer to add the selected host to the host set operate 
responsive to the selected host not being in the host set; and 

the computer-readable program code devices configured to cause a 
computer to add the selected document to the document set of the 
selected host operate responsive to the selected document not 
being in the document set of the selected host. 

37. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
randomly walking through a hypertext-linked document set comprising a plurality 
of documents, wherein at least a sub-set of the documents contain a plurality of 
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links to other documents, each document being associated with a host, the 
computer program product comprising: 

a) computer-readable program code devices configured to cause a 
computer to initialize a host set; 

b) computer-readable program code devices configured to cause a 
computer to initialize a document set for each host in the host set; 

c) computer-readable program code devices configured to cause a 
computer to select at random a host from the host set; 

d) computer-readable program code devices configured to cause a 
computer to select at random a document from the document set of 
the selected host; 

e) computer-readable program code devices configured to cause a 
computer to randomly choose whether to select a random new 
document; and 

f) computer-readable program code devices configured to cause a 

computer to, responsive to f*ef^eesy#F^^ 
evsfrt choosinq not to select a random new document , and further 
responsive to the selected document containing at least one link: 
1 ) select at random a link from the selected document; 
[[e]]f.2) select a document corresponding to the selected link; 
[[e]]i-3) select a host corresponding to the selected document; 
and 

[[e]]f.4) add the selected host to the host set; 

[[e]K-5) add the selected document to the document set of the 

selected host; and 
[[e]]!-6) repeat the operations of [[e]]f.1 through [[e]]f.5) until 

ail Sinks have been traversed~ -»4fF6t"^ 
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eempyteMe&day^^ 

38. (Canceled). 

39. (Original) The computer program product of claim 35, wherein the 
hypertext-linked document set is the World Wide Web, and wherein each 
document is a web page. 

40. (Original) The computer program product of claim 39, wherein each host 
corresponds to a domain. 

41. (Original) A computer program product comprising a computer-usable 
medium having computer-readable code embodied therein for measuring relative 
quality of a search engine index, the computer program product comprising: 

a) computer-readable program code devices configured to cause a 
computer to perform a two-level random walk among documents 
within a document set; 

b) computer-readable program code devices configured to cause a 
computer to, for each document encountered in the random walk, 
determine whether the document is indexed by the search engine 
index; and 

c) computer-readable program code devices configured to cause a 
computer to aggregate the results of the operations of b). 

42. (Currently amended) The computer program product of claim 41 , wherein 
at least a subset of the documents contain a plurality of links to other documents, 
each document being associated with a host, and wherein the computer-readable 
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program code devices configured to cause a computer to perform a two-level 
random walk comprise: 

a.1) computer-readable program code devices configured to cause a 

computer to select a host; 
a.2) computer-readable program code devices configured to cause a 

computer to select at random a document associated with the host; 
a.3) computer-readable program code devices configured to cause a 

computer to retrieve the selected document; 
a.4) computer-readable program code devices configured to cause a 

computer to select at random a link in the retrieved document; 
a.5) computer-readable program code devices configured to cause a 

computer to retrieve a document referenced by the selected link; 

and 

a.6) computer-readable program code devices configured to cause a 
computer to repeat the operations of a.4) and a.5) until a 

jsgectetefm links have been traversed . 

43. (Canceled). 

44. (Currently amended) The computer program product of claim 41, wherein 
at least a subset of the documents contain a plurality of links to other documents, 
each document being associated with a host, and wherein the computer-readable 
program code devices configured to cause a computer to perform a two-level 
random walk comprise: 

a.1) computer-readable program code devices configured to cause 

a computer to initialize a host set; 
a.2) computer-readable program code devices configured to cause a 

computer to initialize a document set for each host in the host set; 
a.3) computer-readable program code devices configured to cause a 

computer to select at random a host from the host set; 
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a.4) computer-readable program code devices configured to cause a 

computer to select at random a link from a document in the 

document set of the selected host; 
a.5) computer-readable program code devices configured to cause a 

computer to add the»ahost referenced by the link to the host set; 
a.6) computer-readable program code devices configured to cause a 

computer to add tfoe-a document referenced by the link to the 

document set of the selected host; 
a.7) computer-readable program code devices configured to cause a 

computer to, responsive to the selected document containing at 

least one link: 

a.7.1 ) select at random a link from the selected document; 
a.7.2) select a document corresponding to the selected link; 
a.7.3) select a host corresponding to the selected document; and 
a.7.4) repeat the operations of a.5) through a.8) until a 

i&recietefmiFH^^ links have been traversed ; 

and 

a.8) computer-readable program code devices configured to cause a 
computer to, responsive to the selected document not containing at 
least one link, repeat the operations of a.3) through a.6), and further 
conditionally repeating a.7) or a.8)., until a~f>f{x!^^^ 
is-me^ all documents have been traversed . 

45. (Original) The computer program product of claim 44, wherein: 

the computer-readable program code devices configured to cause a 

computer to add the selected host to the host set are configured to 

cause a computer to add the selected host responsive to the 

selected host not being in the host set; and 
the computer-readable program code devices configured to cause a 

computer to add the selected document to the document set of the 
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selected host are configured to cause a computer to add the 
selected document responsive to the selected document not being 
in the document set of the selected host. 

46. (Previously presented) The computer program product of claim 41, 
wherein each document contains a plurality of words, and wherein the computer- 
readable program code devices configured to cause a computer to determine 
whether the document is indexed by the search engine index comprise computer- 
readable program code devices configured to, for each document encountered in 
the random walk: 

b.1 ) select at least one word from the document; 

b.2) perform a query on the search engine index based on the selected 

at least one word, to obtain search results; and 
b.3) determine whether the document is included in the obtained search 

results. 

47. (Original) The computer program product of claim 46, wherein the 
computer-readable program code devices configured to select at least one word 
from the document comprise computer-readable program code devices 
configured to select at least one word based on rarity. 

48. (Previously presented) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
measuring relative quality of a target document in a document set, the computer 
program product comprising: 

computer-readable program code devices configured to cause a computer 
to perform a two-level random walk among documents within a 
document set; and 
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computer-readable program code devices configured to cause a computer 
to determine a quality metric responsive to the number of times the 
target document is encountered in the random walk. 

49. (Previously presented) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
measuring relative quality of a target document in a document set comprising a 
plurality of documents, wherein at least a subset of the documents contain a 
plurality of links to other documents, the computer program product comprising: 

computer-readable program code devices configured to cause a computer 
to perform a two-level random walk among documents within a 
document set; and 

computer-readable program code devices configured to cause a computer 
to determine a quality metric responsive to the number of 
documents encountered during the two-level random walk that link 
to the target document. 

50. (Previously presented) The computer program product of claim 49, 
wherein the computer-readable program code devices configured to cause a 
computer to determine a quality metric comprise computer-readable program 
code devices configured to cause a computer to determine a quality metric 
responsive to the number of documents that link to the target document, and 
responsive to the quality metric of the linking documents. 

51. (Previously presented) The computer program product of claim 49, 
wherein the computer-readable program code devices configured to cause a 
computer to determine a quality metric comprise computer-readable program 
code devices configured to cause a computer to determine a value for: 

R{j>) = d/T + (l-d)£R(p i )/C{p i ) 

7=1 
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where: 

R(p) is the PageRank of target document p; 

R(Pi) is the PageRank of document pi; 

T is the total number of documents in the document set; 

d is a damping factor such that 0 < d < 1 ; 

documents p-i,..., Pk each contain at least one link to target document p; 
and 

C(pi) is the number of links out of document pi. 

52. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
measuring relative quality of a target document in a document set comprising a 
plurality of documents, wherein at least a subset of the documents contain a 
plurality of links to other documents, and wherein each document is associated 
with a host, the computer program product comprising: 

computer-readable program code devices configured to cause a computer 

to perform a two-level random walk among documents within a 

document set, by: 

a.1) selecting a host; 

a.2) selecting at random a document associated with the host; 
a.3) retrieving the selected document; 

a.4) randomly choosing whether to select a random new 
document; 

a,5) responsive to eee&ff0&8€^ nq to 

select the random new document : 

a-[[4]]5.1) selecting at random a host from among the 
previously selected hosts; and 

a.[[4]]5.2) selecting at random a document associated 
with the host; and 

a[[4]]5.3) retrieving the selected document; 
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a-[[5]]6) 



responsive to ne^Geewref^e — — fee — 




document: 



a.6JL) selecting at random a link in the retrieved 



document; and 



a.rr7118.2) retrieving a document referenced by the 



selected link; and 



a-[[8]]Z) 



repeating the operations of a.4) , and then 
conditionally repeating the operations of a.5) through 
a.S.3) or a,6) through a-6,2) depending upon the 
choosing made in a-4) -4e~a?7^f*#^ 
oe^di^fKBH^I; and 



computer-readable program code devices configured to cause a computer 
to determine a quality metric responsive to the number of 
documents encountered during the two-level random walk that link 
to the target document. 

53. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
measuring relative quality of a target document in a document set comprising a 
plurality of documents, wherein at least a subset of the documents contain a 
plurality of links to other documents, wherein each document is associated with a 
host, the computer program product comprising: 

computer-readable program code devices configured to cause a computer 

to perform a two-level random walk among documents within a 

document set, by: 

a.1 ) initializing a host set; 

a.2) initializing a document set for each host in the host set; 

a.3) selecting at random a host from the host set; 

a.4) randomly choosing whether to select a random new host; 
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a,5) responsive to «K^fT«^ 
select the random new host : 

a.[[4]]5.1) selecting at random a host from among the 
previously selected hosts; 
a - [[5]] 6) respon si ve to TOn-oeaufFenee — ef — fee — ^mfem 
everrt choosinq not to select the random new host : 
a.[[5]]8.1) selecting at random a document from the 

document set of the selected host; 
a -[[5]]8.2) adding the selected host to the host set; 
a-[[5]]6.3) adding the selected document to the document 

set of the selected host; 
a.[[5]]6.4) responsive to the selected document 
containing at least one link: 
a.[[5]]6.4.1) selecting at random a link from the 

selected document; 
a.[[5]].4.2) selecting a document corresponding to 

the selected link; 
a-[[5]]6.4.3) selecting a host corresponding to the 

selected document; and 
a.[[5]]8.4.4) repeating the operations of a.[[5]]8.2) 
through a.[[5]]8.4.3) until a — firet 
ef edeiermined"--G@n€tlll0n - ifr-met -al I Sinks 
have been traversed ; and 
a-[[6]]Z) repeating the operations of a.3) through a.4), and 
then conditionally repeating a.5) through a.5,1) or a,6) 
t h ro ug h a . 6 .4 ,4 — -Hifltii- -a- "Seae^d"- -Bf ede^effmned 
eeB^tt^He-mel; and 
computer-readable program code devices configured to cause a computer 
to determine a quality metric responsive to the number of 
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documents encountered during the. two-level random walk that link 
to the target document. 

54. (Previously presented) The computer program product of claim 49, further 
comprising: 

c) computer-readable program code devices configured to cause a 
computer to determine a quality metric for at least one additional 
target document; and 

d) computer-readable program code devices configured to cause a 
computer to rank the quality metric of the first target document with 
respect to the quality metrics of the additional target documents. 

55. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
randomly walking through a hypertext-linked document set comprising a plurality 
of documents, wherein at least a subset of the documents contain a plurality of 
links to other documents, each document being associated with a host, the 
computer program product comprising: 

a) computer-readable program code devices configured to cause a 
computer to select a host; 

b) computer-readable program code devices configured to cause a 
computer to select at random a document associated with the host; 

c) computer-readable program code devices configured to cause a 
computer to retrieve the selected document; 

d) computer-readable program code devices configured to cause a 
computer to randomly choose whether to select a random new 
host; 

e) computer-readable program code devices configured to cause a 
computer to, responsive to e9Gyf-renGe--9l--a-^and0rR--0venlchoosinq 
to select the random new host: 
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[[d]]e.1) select at random a new host from among the 

previously selected hosts; and 
[[d]M-2) repeat the operations of b) through d) and then 

conditionally e) through e.2) or f) through 13) until a 

efe^eteml&ed^ documents have 

been traversed; and 
[[e]]D computer-readable program code devices configured to cause a 
computer to, responsive to i^he^G^ffe^ae — ef — the — R^ctem 
everrt choosing not to select the random new host : 
[[©]]!■ 1 ) select at random a link in the retrieved document; 
[[e]]f.2) retrieve a document referenced by the selected link; 

and 

[[e]]l-3) repeat the operations of d)., and then conditionaHy e) 
through e.2) or f) through f.3) until a-ffBcfetert 
^m^CH^s-B^ ai! links have been traversed . 

56. (Currently amended) A computer program product comprising a 
computer-usable medium having computer-readable code embodied therein for 
measuring relative quality of a target document in a document set comprising a 
plurality of documents, wherein at least a subset of the documents contain a 
plurality of links to other documents, the computer program product comprising: 

a) computer-readable program code devices configured to cause a 

computer to perform a two-level random walk among documents 

within a document set by: 

a.1 ) initializing a host set; 

a.2) initializing a document set for each host in the host set; 

a.3) selecting at random a host from the host set; 

a.4) randomly choosing whether to select a random new host; 

a.5) responsive to 0GGbiff0n90--9f--a-randofTi--0v-efi tchoosi nq to 

select a random new host: 
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a-[[4]]5.1) selecting at random a new host from among 
the previously selected hosts; 
a.[[5]]6) responsive to aafre^^iffeBee — e£ — — *aR€tem 
evefrt choosinq not to select the random new host : 
^■[[5]]6.1) selecting at random a link from a document in 

the document set of the selected host; 
a.[[5]]8.2) adding the host referenced by the link to the 
host set; 

a.[[5]]8.3) adding the document referenced by the link to 

the document set of the selected host; 
a-[[5]]6.4) responsive to the selected document 
containing at least one link: 
a.[[5]]8.4.1) selecting at random a link from the 

selected document; 
a.[[5]]6.4.2) selecting a document corresponding to 

the selected link; 
a-[[5]]8.4.3) selecting a host corresponding to the 

selected document; 
si-[[5]]6.4.4) repeating the operations of a.[[5]]8.2) 
through a.[[5]]6.4.3) until a — ftret 
ef e^e^effnlne^ \ Sinks 

have been traversed ; and 
a-[[9]]Z) responsive to the selected document not containing at 
least one link, repeating the operations of a.3) through a.4), 
and then conditionally repeating a.5) through a.5.1) or a,6) 
through a , 6 ■ 4 , 4 ) a^M4 — w4aj — a — seeeRd — BiFBdejiefmiiR^ 
eef^lftefHB-mel; 

b) computer-readable program code devices configured to cause a 
computer to determine a quality metric responsive to the number of 
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documents encountered during the two-level random walk that link 
to the target document; 

c) computer-readable program code devices configured to cause a 
computer to determine a quality metric for at least one additional 
target document; and 

d) computer-readable program code devices configured to cause a 
computer to rank the quality metric of the first document with 
respect to the quality metrics of the additional target documents. 

57. (Currently amended) A system for randomly walking through a hypertext- 
linked document set comprising a plurality of documents, wherein at least a 
subset of the documents contain a plurality of links to other documents, each 
document being associated with a host, the system comprising: 

a) a host selector; 

b) a random document selector, coupled to the host selector, for 
selecting at random a document associated with the host; 

c) a document retriever, coupled to the random document selector, for 
retrieving the selected document; and 

d) a link selector, coupled to the document retriever; 

wherein, responsive to eeefcEFFeftee--^^ the host selector 

randomly choosing to select a random host : 

the host selector selects at random a host from among the 

previously selected hosts; 
the random document selector selects at random a document 

associated with the host; and 
the document retriever retrieves the selected document; and 
wherein, responsive to ^efH9@a^fefi-ee— €^4I^H^d0B^-B¥eB ithe host 
seiector randomly choosing not to seiect a random host : 

the link selector selects at random a link in the retrieved document; 
and 
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the document retriever retrieves a document referenced by the 
selected link; and 

and wherein the link selector, the random document selector, and the 
document retriever repeat their respective operations until a 

afede^eim^ links have been traversed . 

58. (Original) A system for measuring relative quality of a search engine 
index, comprising: 

a random walker, for performing a two-level random walk among 

documents within a document set; 
a determination module, coupled to the random walker, for, for each 

document encountered in the random walk, determining whether 

the document is indexed by the search engine index; and 
a results aggregation module, coupled to the determination module, for 

aggregating the results of the determination module. 

59. (Previously presented) A system for measuring relative quality of a target 
document in a document set, comprising: 

a random walker, for performing a two-level random walk among 

documents within a document set; and 
a determination module, coupled to the random walker, for determining a 

quality metric responsive to the number of times the target 

document is encountered in the random walk. 

60. (Previously presented) A system, comprising: 
a processor; and 

memory containing software executable by the processor; 
wherein, by executing the software, the processor initializes a document 
set, selects an arbitrary hyperlink included in a selected document 
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in the document set, and adds a document referenced by the 
hyperlink to the document set. 

61. (Previously presented) The system of claim 60 wherein the processor 
further initializes a host set and adds a host referenced by the arbitrary hyperlink 
to the host set. 

62. (Previously presented) The system of claim 60 wherein the processor 
further determines whether the document referenced by the arbitrary hyperlink is 
included in a search engine index. 

63. (New) The method of claim 2, wherein the repeating of d) and the 
conditional repeating of e) or f) continues until all documents have been 
traversed. 

64. (New) The method of claim 7, further comprising: 

f) repeating c) through d), and further conditionally repeating e) if the 
selected document contains at least one link, until all documents 
have been traversed. 

65. (New) The method of claim 9, further comprising: 

g) repeating c) through e), and further conditionally repeating f) if a 
random new document is not chosen, until all documents have 
been traversed. 

66. (New) The method of claim 15, wherein the repeating of a.4) and the 
conditional repeating of a.4.1) through a.4. 1.3) or a.4. 2) through a.4.2.2) 
continues until all documents have been traversed. 
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67. (New) The method of claim 24, wherein the repeating of a.4) and the 
conditional repeating of a.5) through a.5.3) or a.6) through a.6.2) continues until 
all documents have been traversed. 

68. (New) The method of claim 25, wherein the repeating of a.3) through a.4) 
and the conditional repeating of a.5) through a.5.1) or a.6) through a.6.2. 6) 
continues until all documents have been traversed. 

69. (New) The method of claim 28, wherein the repeating of a.3) through a.4) 
and the conditional repeating of a.5) through a.5.1 or a.6) through a.6.2. 6) 
continues until all documents have been traversed. 

70. (New) The computer program product of claim 30, wherein the computer- 
readable program code devices are further configured to continue to cause a 
computer to repeat a.4) and the conditionally repeat a.4.1) through a.4. 1.3) or 
a.4.2) through a.4. 2.2) until all documents have been traversed. 

71 . (New) The computer program product of claim 35, further comprising: 

f) computer readable program code devices configured to cause a 
computer to repeat c) through d), and further conditionally repeat e) 
if the selected document contains at least one link, until all 
documents have been traversed. 

72. (New) The computer program product of claim 37, further comprising: 

g) computer readable program code devices configured to cause a 
computer to repeat c) through e), and further conditionally repeat f) 
if a random new document is not chosen, until all documents have 
been traversed. 
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73. (New) The computer program product of claim 52, wherein the repeating 
of a.4) and the conditional repeating of a.5) through a.5.3) or a.6) through a.6.2) 
continues until all documents have been traversed. 

74. (New) The computer program product of claim 53, wherein the repeating 
of a.3) through a.4) and the conditional repeating of a.5) through a.5.1) or a.6) 
through a.6.4.4) continues until all documents have been traversed. 

75. (New) The computer program product of claim 56, wherein the repeating 
of a.3) through a.4) and the conditional repeating a.5) through a.5.1) or a.6) 
through a.6.4.4) continues until all documents have been traversed. 
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